Labeler agreement in transcribing korean intonation with K-toBI
نویسندگان
چکیده
This paper reports labeler agreement in the transcription of Korean prosody using Korean ToBI (K-ToBI) [9]. Twenty utterances representing five different types of speech were produced by 18 speakers and transcribed by 21 labelers differing in their levels of experience with K-ToBI. Following the stringent metric used for English ToBI evaluation [14,12], consistency was measured in terms of the number of transcriber pairs agreeing on the labeling of each particular word. The results show that for tonal transcriptions of the 32,130 transcriber-pair-words, agreement was 77% for the type of boundaries at the end of each word (i.e., word, AP, or IP), 78% for AP boundaries, and 91% for IP boundaries. For break indices, the agreement score for exact matching in the labeling was 59%, 69% when relaxing the presence/absence of diacritics, and 99% when relaxing within +/-1 level. In sum, the data confirm that the conventions of K-ToBI are adequate, easy to learn, and can be reliably used for research in Korean prosody and for large-scale prosodic annotation in speech databases.
منابع مشابه
Rules for the generation of ToBI-based American English intonation
This study presents an approach to the generation of American English intonation based on prescriptive rules that define the respective features of certain tone labels that in turn represent linguistically relevant F0 configurations. In accordance with the principles of the Tone Sequence Model the F0 contour is analyzed as a series of discrete target values that are connected by means of transi...
متن کاملDetermining prominence and prosodic boundaries in Korean by non-expert rapid prosody transcription
This paper examines how non-expert listeners perceive prominence and prosodic boundaries in Korean using the Rapid Prosody Transcription (RPT) method, developed by Mo, Cole and Lee [9] for American English. While prominence is used to mark prosodically salient or “highlighted” words and phrases, prosodic boundaries demarcate units or “chunks” of speech to mirror the hierarchical relations among...
متن کاملAn autosegmental/metrical model of Chickasaw intonation
This chapter presents a formal model for transcribing the intonational properties of Chickasaw, a Western Muskogean language spoken in south-central Oklahoma by perhaps a few hundred speakers. The proposed model adopts many elements and assumptions of the autosegmental/metrical (AM) framework originally developed to analyse English intonational structure (Pierrehumbert 1980, Silverman et al. 19...
متن کاملMatching a tone-based and tune-based approach to English intonation for concept-to-speech generation
Tlle paper describes the results of a comparison of two annotation systems for isstoslal;ion, the tone-based ToBI al)proach and the 1;unebased api)roach proposed by Systemic Functi(mal Grammar (SFO). The goal of this comparison is to detine a mapping between the two systems tbr the purpose of concept-to-speech generation of English. Since ToB: is widely used in Sl)eech synthesis and SFG is wide...
متن کاملIntonation issues in HMM-based speech synthesis for Vietnamese
In an HMM-based Text-To-Speech system, contextual features, including phonetic and prosodic factors have a significant influence to the spectrum, F0 and duration of the synthetic voice. This paper proposes prosodic features aiming at improving the naturalness of an HMM-based TTS system (VTed) for a tonal language, Vietnamese. The ToBI (Tones and Break Indices) features are used to learn two cru...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000